Lazy Propositionalisation for Relational Learning

نویسندگان

  • Érick Alphonse
  • Céline Rouveirol
چکیده

A number of Inductive Logic Programming (ILP) systems have addressed the problem of learning First Order Logic (FOL) discriminant definitions by first reformulating the FOL learning problem into an attribute-value one and then applying efficient learning techniques dedicated to this simpler formalism. The complexity of such propositionalisation methods is now in the size of the reformulated problem which is exponential when tackling determinate relational problems. We propose a method that selectively propositionalises the FOL training set by interleaving attribute-value reformulation and algebraic resolution. It avoids, as much as possible, the generation of reformulated examples which are not relevant wrt the discrimination task, and still ensures that explicit correct and complete definitions are learned. We present an AQ-like algorithm exploiting this lazy propositionalisation method and then provide a first empirical evaluation on a standard benchmark dataset for ILP, the Mutagenesis problem.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Good and Bad Practices in Propositionalisation

Data is mainly available in relational formats, so relational data mining receives a lot of interest. Propositionalisation consists in changing the representation of relational data in order to apply usual attribute-value learning systems. Data mining practitioners are not necessarily aware of existing works and try to propositionalise by hand. Unfortunately there exists some tempting pitfalls....

متن کامل

Lazy and Eager Relational Learning Using Graph-Kernels

Machine learning systems can be distinguished along two dimensions. The first is concerned with whether they deal with a feature based (propositional) or a relational representation; the second with the use of eager or lazy learning techniques. The advantage of relational learning is that it can capture structural information. We compare several machine learning techniques along these two dimen...

متن کامل

Approaching the ILP 2005 Challenge: Class-Conditional Bayesian Propositionalization for Genetic Classification

This report presents a statistical propositionalisation approach to relational classification and probability estimation on the genetic ILP Challenge domain. The main difference between our and existing propositionalisation approaches is its ability to construct features from categorical attributes with many possible values and in particular the object identifiers. Our classification and rankin...

متن کامل

Involving Aggregate Functions in Multi-relational Search

The fact that data is scattered over many tables causes many problems in the practice of data mining. To deal with this problem, one either constructs a single table by propositionalisation, or uses a Multi-Relational Data Mining algorithm. In either case, one has to deal with the non-determinacy of one-to-many relationships. In propositionalisation, aggregate functions have already proven to b...

متن کامل

Classification of symbolic objects: A lazy learning approach

Symbolic data analysis aims at generalizing some standard statistical data mining methods, such as those developed for classification tasks, to the case of symbolic objects (SOs). These objects synthesize information concerning a group of individuals of a population, eventually stored in a relational database, and ensure confidentiality of original data. Classifying SOs is an important task in ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000